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ABSTRACT 

This is the first of a series of papers in which we derive simuhaneous constraints on 
cosmological parameters and X-ray scaling relations using observations of the growth 
of massive, X-ray flux-sclcctcd galaxy clusters. Our data set consists of 238 cluster 
detections from the i?(9S'^T All-Sky Survey, and incorporates follow-up observations 
of 94 of those clusters using the Chandra X-ray Observatory or ROSAT. Here we 
describe and implement a new statistical framework required to self-consistently pro- 
duce simultaneous constraints on cosmology and scaling relations from such data, and 
present results on models of dark energy. In spatially flat models with a constant dark 
energy equation of state, w, the cluster data yield fim = 0.23 ± 0.04, erg — 0.82 ± 0.05, 
and w = —1.01 ± 0.20, incorporating standard priors on the Hubble parameter and 
mean baryon density of the Universe, and marginalizing over conservative allowances 
for systematic uncertainties. These constraints agree well and are competitive with 
independent data in the form of cosmic microwave background anisotropics, type 
la supernovae, cluster gas mass fractions, baryon acoustic oscillations, galaxy red- 
shift surveys, and cosmic shear. The combination of our data with current microwave 
background, supernova, gas mass fraction, and baryon acoustic oscillation data yields 
rJni = 0.27 ±0.02, (Tg = 0.79 ±0.03, and w = -0.96 ±0.06 for flat, constant w models. 
The combined data also allow us to investigate evolving w models. Marginalizing over 
transition rcdshifts in the range 0.05-1, we constrain the equation of state at late and 
early times to be respectively wq = — 0.88±0.21 and w^t = — l-05^Q'3g, again including 
conservative systematic allowances. The combined data provide constraints equivalent 
to a Dark Energy Task Force figure of merit of 15.5. Our results highlight the power 
of X-ray studies, which enable the straightforward production of large, complete, and 
pure cluster samples and admit tight scaling relations, to constrain cosmology. How- 
ever, the new statistical framework we apply to this task is equally applicable to cluster 
studies at other wavelengths. 

Key words: cosmology: observations - cosmological parameters - large-scale struc- 
ture of Universe - X-rays: galaxies: clusters. 



1 INTRODUCTION 

Clusters of galaxies have a long history as cosmological 
laboratories, beginning with the discovery of dark matter 
(Zwicky 1937) through studies of cluster galaxy orbits, and 
providing early evidence for a low matter density universe 
(White et al. 1993) through studies of their baryonic and 
dark matter content. These breakthroughs rested on obser- 
vations of the properties of individual systems, but the pop- 
ulation of clusters as a whole also contains a great deal of 
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cosmological information. Clusters represent the most mas- 
sive gravitationally bound systems in the Universe, and as 
such their abundance probes the amount of structure in the 
Universe and its growth over cosmic time. 

The local cluster population has been used to jointly 
constrain the average density of matter, fim, and the ampli- 
tude of density perturbations, as, (recently by Henry et al. 
2009; Rozo et al. 2010, but see also Reiprich & Bohringer 
2002; Allen et al. 2003; Schuecker et al. 2003 and additional 
references in Mantz et al. 2008, hereafter M08). Most re- 
cently, the construction of X-ray flux-limited cluster samples 
out to redshift z — 0.5 and beyond has enabled studies of 
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the growth of structure (M08; Vikhlinin et al. 2009b). Such 
studies provide an important new window on cosmology, as 
various dark energy and modified gravity models designed 
to explain the acceleration of the cosmic expansion are po- 
tentially distinguishable by their effects on structure forma- 
tion (e.g. Rapetti et al. 2009a, b; Schmidt, Vikhlinin, & Hu 
2009). 

The key ingredients for investigations of the cluster pop- 
ulation are (1) a cluster survey with a well understood se- 
lection function,^ and (2) scaling relations that link cluster 
mass to observable quantities. Currently, the most successful 
approach to finding massive clusters over a range of redshifts 
is through the X-ray emission of the hot intracluster gas, al- 
though Sunyaev-Zel'dovich and optical surveys are also mak- 
ing significant progress. Because Malmquist and Eddington 
biases are ubiquitous in current X-ray fiux-limited samples, 
great care is warranted in their analysis. In particular, pre- 
cise, robust cosmological constraints can only be obtained by 
simultaneously fitting the X-ray luminosity-mass relation, 
as we do here. Conversely, rigorous analysis of the scaling 
relations must take into account the cluster mass function 
and the selection function of the data set, as discussed by 
Mantz et al. (2009, hereafter Paper II; see also Stanek et al. 
2006; Pacaud et al. 2007). 

While a flux- limited sample, combined with more de- 
tailed follow-up observations of a subset of clusters in that 
sample, may contain the necessary information to provide si- 
multaneous constraints on cosmology and scaling relations, 
a fully self-consistent statistical framework for this analy- 
sis has been lacking to this point. By "self-consistent", we 
mean that a single likelihood function applying to the full 
data set (survey -|- follow-up observations) and encompass- 
ing the entire theoretical model (cosmology -\- scaling rela- 
tions) should be derived from first principles, ensuring that 
the covariance among all the model parameters is fully cap- 
tured and that the effects of the mass function and selection 
biases are properly accounted for throughout. 

This is the first of a series of papers in which we ad- 
dress these issues. Paper II contains details of the follow-up 
X-ray observations and their reduction, presents the con- 
straints on scaling relations from our simultaneous analysis, 
and discusses their astrophysical implications. In this paper, 
we present the statistical methods applied to the problems 
described above and the resulting cosmological constraints. 
The dark energy models we address here include the simple 
cosmological constant model; models with a constant dark 
energy equation of state, w\ and simple evolving w models. 
As we will show, our analysis produces some of the tight- 
est constraints on dark energy parameters of any experi- 
ment to date. In Papers III (Rapetti et al. 2009b) and IV 
(Mantz, Allen, & Rapetti 2009), we respectively apply our 
analysis to investigations of modified gravity and neutrino 
properties. 

In Section 2, we briefiy review the cluster data set, 
which is more fully described in Paper II. Section 3 contains 
a full description of the theoretical model fit to the data, 



Throughout this paper, wc use the term "selection function" to 
refer to the probability that a cluster be detected by a survey and 
included in the resulting data set, as a function of that cluster's 
physical properties (redshift, flux etc.). See Section 4.1. 



including the cosmology, cluster mass function, and scaling 
relations. The new, self-consistent analysis method is pre- 
sented in Section 4.1. Sections 5 and 6 respectively contain 
the cosmological results of our analysis and a discussion of 
relevant systematics. 

In this paper, we adopt the conventional definition of 
cluster radius in terms of the critical density of the Universe; 
thus, rA is the radius within which the mean density of 
the cluster is A times the critical density at the cluster's 
redshift, pcr(z). An alternative convention, used particularly 
in the literature relevant to the mass function, is to define 
the overdensity with respect to the mean matter density at 
redshift z. We will consistently use the former convention, 
and so, for example, an overdensity of 300 with respect to 
the matter density will be written as A = 300f2m(2;), where 
Q,m{z) is the ratio of the mean matter density to the critical 
density. 



2 DATA 

The galaxy cluster data used in this work, as well 
as their selection and reduction, are discussed in de- 
tail in Paper II. Using three wide-area cluster samples 
drawn from the ROSAT All-Sky Survey (RASS; Triimper 
1993) - the ROSAT Brightest Cluster Sample (BCS; 
Ebeling et al. 1998), the ROSAT-'ESO Flux-Limited X-ray 
sample (REFLEX; Bohringer et al. 2004), and the bright 
sub-sample of the Massive Cluster Survey (Bright MACS; 
Ebeling, Edge, & Henry 2001; Ebeling et al. 2010) - we se- 
lect a statistically complete sample of 238 X-ray luminous 
clusters covering the redshift range z < 0.5. Of these 238 
clusters, 94 have follow-up Chandra or ROSAT observa- 
tions that we incorporate into the analysis. To distinguish 
it from cluster data used in other cosmological work (e.g. 
optically selected clusters), we refer to this data set as the 
cluster X-ray Luminosity Function (XLF), although in fact 
the data set contains a great deal more information than 
the luminosity function alone. From the follow-up observa- 
tions, we measure X-ray luminosity, average temperature, 
and gas mass within rsoo. The gas mass is used as a proxy 
for total mass, using the finding of Allen et al. (2008, here- 
after A08) that the gas mass fraction, /gas = Mgas/Mtot, 
is a constant for the hot, massive clusters being studied.^ 
The systematic uncertainty associated with the /gas mea- 
surement is accounted for by simultaneously constraining 
the full /gas model of A08, which includes generous sys- 
tematic allowances for instrument calibration, non-thermal 
pressure support, the depletion of baryons in clusters rela- 
tive to the cosmic mean, and evolution in the baryon de- 
pletion and stellar content of clusters. We use only the six 
lowest redshift clusters from A08 {z < 0.15), which are suf- 
ficient to constrain the gas mass fraction at low redshift 
without directly producing a constraint on dark energy by 

^ We prefer to use Afgas as a proxy for total mass rather than 
the thermal energy, Yx = Mga,skT, because Afgas can be mea- 
sured more precisely than temperature for a given exposure time, 
and because Mgas measurements at rsoo a^re minimally affected 
by background and emission weighting uncertainties. In addition, 
Mgas displays exceptionally small scatter with mass (see AOS and 
Paper II) in the mass range considered here, M > 3 X 10^^ ATq. 
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themselves. In our analysis of the XLF data, we also use 
Gaussian priors to constrain the Hubble parameter and the 
mean baryon density, based on the results of the Hubble 
Key project {h = J/o/100 kms"^ Mpc"^ = 0.72 ± 0.08, 
Freedman et al. 2001) and big bang nucleosynthesis studies 
(fib/i^ = 0.0214 ± 0.002, Kirkman et al. 2003). 

In addition, we compare and combine results from our 
own analysis with those of independent cosmological data, 
including the full cluster /gas data set (AOS, 42 clusters at 
z < 1.1), as well as cosmic microwave background (CMB), 
type la supernova (SNIa) and baryon acoustic oscillation 
(BAO) data. Our analysis of the CMB anisotropics uses 5- 
year Wilkinson Microwave Anisotropy Probe ( WMAP) data 
(Hinshaw et al. 2009; Hill et al. 2009; Noha et al. 2009) 
with the March 2008 version of the WMAP likelihood code^ 
(Dunkley et al. 2009). The SNIa results are derived from 
the Union compilation (Kowalski et al. 2008), which in- 
cludes data from a variety of sources (307 SNIa in total; 
Hamuy et al. 1996; Garnavich et al. 1998; Riess et al. 1998, 
1999, 2004, 2007; Schmidt et al. 1998; Perlmutter et al. 
1999; Krisciunas et al. 2001, 2004a,b; Knop et al. 2003; 
Tonry et al. 2003; Harris et al. 2004; Astier et al. 2006; 
Jha et al. 2006; Miknaitis et al. 2007; Wood-Vasey et al. 
2007), including the treatment of systematic errors em- 
ployed by Kowalski et al. Our analysis of BAO data uses 
the constraints on the ratio of the sound horizon to the 
distance scale at z — 0.25 and z = 0.35 derived by 
Percival et al. (2007) from the galaxy correlation function in 
2dF (CoUess et al. 2001, 2003) and Sloan Digital Sky Survey 
(Adelman-McCarthy et al. 2007) data. 

When fitting CMB data, we allow the scalar spectral in- 
dex, ria, and optical depth to reionization, r, to vary as free 
parameters, and marginalize over a plausible range in the 
amplitude of the Sunyaev-Zel'dovich signal due to galaxy 
clusters (0 < Agz < 2; introduced by Spergel et al. 2007). 
The combination of CMB and /gas data places tight con- 
straints on both h and fib/i^ in addition to other parame- 
ters of interest (see A08), so the Hubble Key project and big 
bang nucleosynthesis priors are unnecessary in analyses of 
the combined data sets. This applies as well to the combina- 
tion of CMB and XLF data, since we always use the z < 0.15 
subset of the /gas data to calibrate the cluster mass scale for 
the XLF analysis. 



3 MODEL 

In this section, we detail the various pieces of the model be- 
ing fit to the XLF data. Section 3.1 reviews the important 
cosmological parameters and the dark energy models to be 
addressed, while Section 3.2 focuses on the predicted mass 
function and its cosmological dependence. In Section 3.3, 
we outline the model for the cluster scaling relations. Sec- 
tions 3.4 and 3.5 describe the sampling model, i.e. the model 
for how a set of parameters predicts the measured data, in- 
cluding measurement errors and their correlation, and im- 
plicit dependences on reference cosmological values. The 
model parameters are summarized in Table 1. 
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3.1 Cosmological models 

The simplest cosmological model considered in this study 
contains dark energy in the form of a spatially uniform 
and non-evolving energy density, i.e. a cosmological constant 
(ACDM). In this model, the cosmological parameters rele- 
vant for the growth of structure are the mean baryon density. 
Sib; the mean total matter density. Qui', the Hubble param- 
eter, h; and the matter power spectrum normalization, erg. 
Here the mean densities refer to redshift zero, since their val- 
ues at other times are then determined by the Friedmann 
equation, and o"| is the z = variance in the density field at 
scales of 8h~^ Mpc, defined explicitly in Equation 4 below. 
Because the mass range of our data corresponds to a small 
range in scale, we do not simultaneously fit for the spec- 
tral index of scalar density perturbations, ris, but rather fix 
its value at 0.95 (e.g. Komatsu et al. 2009), except when si- 
multaneously fitting CMB data (see Section 2 and Table 1). 
This assumption does not significantly affect our results; see 
Section 5.1. We assume that the Universe is spatially flat on 
large scales throughout. 

We additionally consider models in which dark energy 
is a fluid parametrized by a constant equation of state, w 
(constant w models). Unlike the cosmological constant sce- 
nario, such a fluid will not in general have uniform density, 
and thus contributes to some degree to the evolution of den- 
sity perturbations, in addition to influencing the expansion 
history of the Universe. Due to theoretical uncertainties on 
the behavior of the dark energy fluid on the nonlinear scales 
that determine the mass function, numerical simulations of 
the mass function have to date been done only for mod- 
els in which dark energy is uniform, even when w 7^ — 1. 
Our approach is to straightforwardly propagate the influ- 
ence of non-uniform dark energy on linear scales but to 
leave the mass function unaltered, while continuing to use 
our standard systematic allowances on the mass function 
and its evolution (Section 3.2). We have verified that the 
value of the dark energy sound speed (assumed to be con- 
stant with time) has no effect on our results. Preliminary 
theoretical work indicates that the effect of dark energy per- 
turbations on the mass function might be readily measur- 
able (Abramo, Batista, & Rosenfeld 2009; Creminelli et al. 
2010; Park et al. 2009; Alimi et al. 2010), in which case our 
approach likely underestimates the ability of the data to 
discriminate among these models. 

Finally, we consider models in which the dark energy 
equation of state is a function of time, according to two 
parametrizations. The first is the commonly used model of 
Chevallier & Polarski (2001) and Linder (2003), 

■w{a) — wo + Wa{l — a), (1) 

where a — 1/(1 + z) is the scale factor, in which the equa- 
tion of state makes a smooth transition from value wq at 
z — to Wo + Wa at high redshift. A generalization due to 
Rapetti et al. (2005), 



has the advantage that the transition redshift, Zt, can be 
marginalized over. (Equation 1 is a special case of Equa- 
tion 2 with Zt = 1 and Wa = Wct ~ wo.) This model has 
greater applicability to current data, which primarily con- 
strain w at z < 1, resulting in more commensurate con- 
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Table 1. Parameters and priors used in the analysis. The nuisance parameters associated with the /gas model (AOS) are not shown, 
although they are also marginalized over. When no entry appears in the prior column, the prior was uniform and significantly wider than 
the marginal posterior for that parameter. Af{fJ,,(y) represents the normal distribution with mean /i and variance , and U{xi,X2) the 
uniform distribution with endpoints xi and X2. For brevity, J\fi{^i, ■ . .) represents the multivariate normal prior used for the Tinker et al. 
(2008) mass function parameters, where n is the marginal mean for each parameter and the covariance matrix is not explicitly shown. 
Notes: " when using CMB data, the angular size of the sound horizon at last scattering replaces ft as a free parameter; ^ indicates priors 
that are not used when CMB data is included in the analysis; indicates parameters that arc free only when CMB data is included; 
indicates that there is an independent parameter of this type for each cluster sample. 



Type 



Symbol Meaning 



Prior 



Cosmology (§3.1) 



Mass function (§3.2) 



Scaling relation (§3.3) 



Other (§3.4, 3.5) 



h 

ln(10"Ms 

T 

w 
Wo 

at 
A 
a 



PO 'Pi 
atm atm 
PO 'Pi 

PUm 



Vg 
VL 



Hubble parameter 

Baryon density ^ 

Cold dark matter density 

Scalar power spectrum amplitude 

Scalar power spectrum slope ^''^ 

Optical depth to rcionization 

Sunyaev-Zcl'dovich signal amplitude 

Constant dark energy equation of state 

Evolving w. current value 

Evolving w: value at early times 

Evolving ui: transition scale factor 

Global amplitude 
Low mass amplitude 
Low mass slope 
Exponential cutoff scale 
Evolution strength 

Nominal luminosity-mass relation 
Nominal temperature-mass relation 
Marginal scaling relation scatters 
Scaling relation scatter correlation 

Chandra temperature calibration 
Cluster gas mass profile logarithmic slope 
Cluster luminosity profile logarithmic slope 
^'soo-to-survey fiux conversion 
Completeness/purity 



A"(0.72,0.08) 
A^(0.0214, 0.002) 



= 0.95 
W(0,2) 



W(0.5,0.95) 

^(0.20,...) 
^(1.52,...) 
A^4(2.25, . . .) 
A/'4(L27,...) 
Ar(l. 0,0.1) 



10% Normal 
A^(l. 092, 0.006) 
A^(0.1135, 0.0005) 

W(0.95, 1.05) 



straints on the current and early-time equation of state, too 
and Wet- In practice, we marginalize over the scale factor of 
the transition, at, within the range 0.5 < at < 0.95. 



3.2 Mass function 

Cosmological analyses of the kind presented here are en- 
abled by the fact that, to good approximation, the expected 
number density of dark matter halos as a function of mass, 
M, can be expressed as a relatively simple function of cos- 
mological parameters. 



dn{M, . 



din (J 



M dM 



(3) 



Here is the mean comoving matter density and is the 
variance of the linearly evolved density field, smoothed by 
a spherical top-hat window of comoving radius r, enclosing 
mass M = 47rpnir^/3, 



fc^P(fc,2)|WA/(fc)|^rf/c, 



(4) 



only through a'^{M,z). The function /((t) may be an an- 
alytic or semi-analytic approximation (Press & Schechter 
1974; Bond et al. 1991; Sheth & Tormen 1999) or a fit to 
cosmological A''-body simulations. 

The applicability of this "universal" form of the mass 
function was first demonstrated in numerical dark matter 
simulations of fiat ACDM and open (SIa = 0) cosmologies by 
Jenkins et al. (2001) and confirmed by Evrard et al. (2002). 
It has since been verified that the fitting function provided 
by Jenkins is approximately accurate (within ~ 20 per cent) 
among models with constant w ^ —l and some evolv- 
ing w models (Klypin et al. 2003; Linder & Jenkins 2003; 
Lokas et al. 2004; Kuhlen et al. 2005).* Other authors have 
studied the dependence of f{cr) on redshift beyond that 
implicit in a'^{M,z) (Lukic et al. 2007; Reed et al. 2007; 
Cohn & White 2008), replacing /(a) with /(o", z). The most 
recent and relevant work is that of Tinker et al. (2008), 
which we adopt here. 

The Tinker fitting function has the form 



+ 1 



c/o 



(5) 



where P(fc, z) is the linear power spectrum evolved to red- 
shift 2 and Whiik) is the Fourier transform of the win- 
dow function. In the formulation of Equation 3, the mass 
function depends on cosmological parameters and redshift 



^ We reiterate that these works include the effects of dark energy 
on the mass function through the cosmic expansion rate, but not 
the effects of dark energy density perturbations (Section 3.1). 
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where each of the fitted parameters has a redshift depen- 
dence of the form 

x{z) = xoil + zY"" ■ x&{A,a,b,c}. (6) 

The various parameters xq and Ox are given in Tinker et al. 
(2008) as a function of the spherical overdensity, A, used to 
define the cluster radius. Unlike the z = mass function, 
this additional redshift dependence has not been tested in 
simulations of cosmologies beyond the simple, flat ACDM 
model. We therefore introduce the parameter e, which con- 
trols the overall strength of the evolution given by the Ox, 
in order to marginalize over remaining uncertainties in the 
redshift dependence of the mass function in exotic cosmolo- 
gies. We also choose to work with the A = 300f2m(2) fit to 
f{a, z) (a relatively large cluster radius) because the evolu- 
tion parametrized by the ax becomes more pronounced with 
increasing overdensity (smaller radius). 

To address the uncertainty in the normalization and 
shape of /(cr, 2 = 0), we marginalized over each of the fit- 
ted parameters in Equation 5 using the covariance matrix 
of the fit (Jeremy Tinker, private communication). The sta- 
tistical error of this fit is < 5 per cent; however, this figure 
does not reflect systematic uncertainties due to the pres- 
ence of baryons (e.g. Stanek et al. 2009), evolving dark en- 
ergy, etc. We therefore scaled the covariance matrix when 
deflning this prior on the mass function parameters such 
that the marginal uncertainty at flxed logj^Q((T~^) — 0.2 
(M ~ 10^^ Af0 in the concordance model) is a conservative 
10 per cent.^ 



3.3 Scaling relations 

To perform the cosmological analysis, we need to relate clus- 
ter mass to the observable that determines cluster detection, 
in this case X-ray flux. Given a redshift, z, a cluster's un- 
absorbed, soft X-ray flux, F, is determined by its intrinsic 
X-ray luminosity, L, temperature, kT, and metallicity, Z, as 



F{z,L,kT, Z) 



L 



indl{z)K(z,kT, Zy 



(7) 



where dL{z) is the luminosity distance to the cluster, and 
K{z, kT, Z) is the required if-correction. For intracluster 
medium temperatures kT > 3keV and luminosities and 
fiuxes in the soft (ROSAT) X-ray band (0.1-2.4 keV), K 
has only a weak dependence on temperature and negligible 
dependence on metallicity; we hereafter fix Z to the typical 
value of 0.3 times the solar value. 

As discussed in Paper II, a simple prescription for how 
L and kT are related to the total mass, M, is given by 
the self-similar model (Kaiser 1986). We define nominal 
luminosity-mass and temperature-mass relations at rsoo of 



^ There is no reason, a priori, that the systematic uncertainty 
in the mass function should have a similar form to the statis- 
tical covariance. At minimum, however, this procedure provides 
a straightforward way to marginalize over a family of functions 
that are similar to the mass function, while still allowing differ- 
ences in the shape as well as the normalization. As we show in 
Section 6.2, uncertainty on the mass function at this level has 
negligible impact on our results, in any case. 



the form 

{£{m)} = 
(t(m)) = 

where 
£ = logi 

m = log^ 



Po + Pi m, 

ntm I ntm 



(8) 



E{z)W^^ ergs-i 



E{z)M5oo 
1015 Me 
kn. 



(9) 



, , f kl 500 \ 



and E{z) is the normalized Hubble parameter, H{z)/Hq. 
The factors oi E{z) appearing explicitly in Equation 9 follow 
from the definition of cluster radius using a fixed overdensity 
with respect to the critical density (Bryan & Norman 1998). 
To describe the intrinsic scatter in I and t given m about 
the nominal relations, we adopt a simple, bivariate normal 
distribution parametrized by the marginal luminosity-mass 
and temperature-mass log-normal scatters, aim and otm, 
and a coefficient of correlation, pitm- 

There are various ways of adding complexity to this 
scaling relation model, including departures from self-similar 
evolution in the normalization of the nominal relations, evo- 
lution in the scatter, and asymmetry in the scatter. In 
Paper II, we show that the data are consistent with the sim- 
ple model defined above, and do not require or prefer any 
such additions, even when the cosmological parameters are 
extremely restricted by external data. We therefore adopt 
the simple model above in this work. 

Motivated by the results of Evrard et al. (2008), we 
have defined the scaling relations for quantities within 
rsoo, which also corresponds to our measurements of 
mass, luminosity and temperature from follow-up obser- 
vations (Paper II). To convert the mass definition used 
by the mass function, A = 300r2m(2), to A = 500, we 
use the procedure of Hu & Kravtsov (2003), assuming a 
Navarro, Frenk, & White (1997, hereafter NFW) mass dis- 
tribution with concentration parameter c = 4. This con- 
version is negligibly sensitive to the assumed concentration 
parameter, since both rsoo and r3oon„,(z) are well beyond the 
NFW scale radius for reasonable values (c > 3; Zhao et al. 
2003, 2009; Gao et al. 2008). 



3.4 Sampling model: follow-up observations 

The next component of the model connects quantities pre- 
dicted from the cosmology, mass function and scaling rela- 
tions to quantities measured from the follow-up X-ray obser- 
vations. Because some of our measurements are made with 
respect to a reference cosmology (see Paper II), this proce- 
dure is not entirely trivial. 

In particular, the mass, luminosity and temperature de- 
termined from the follow-up X-ray observations are mea- 
sured within rsoo , itself determined via the implicit equation 



M(r5oo) 



—? — 71 — r — ^lOt)UJPcr(,2jr500, 

/gas(,rsooj J 



which can be re-written 

Mgas(r) oc pcr(z)r^ fg!^s{r) oc r''", 



(10) 



(11) 
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where r]g is the logarithmic slope of the gas mass profile at 
large radius. Using that pci{z) oc H'^{z), the expression 

rsoooc [/gas(r5oo)i/'W]'^''"'"'' (12) 

relates the "true" value of rsoo predicted by a set of model 
parameters to the value of rsoo that we would have inferred 
assuming our reference cosmology and reference /gas value. 
The gas mass profiles measured in Paper II are self similar, 
consistent with a constant value of 773 ; for simplicity, we 
therefore adopt rjg — 1.092 ± 0.006, determined from a fit 
to the entire sample from 0.7-1.3r5oo, and marginalize over 
the uncertainty. 

To see how the measurements of total cluster mass de- 
pend on model parameters, we first write 



M{r) 



M-^(r)//-f,(r) 

Mgas(r)//gas(r) 



Rnfw 



(13) 



where M'^'^ is the prediction for what mass would be mea- 
sured for a cluster of true mass M using our assumed ref- 
erence parameter values, and d^iz) is the angular diameter 
distance to redshift z. (f^^^z) and /gj' are not predictions 
for measured values, but depend directly on the reference 
cosmology and /gas value. The first term in this expression 
accounts for the dependence of the mass measured within a 
fixed angular aperture on distance and /gas, given that the 
mass is estimated via the gas mass and gas mass fraction, 
and where we have used the scaling Mgas oc dA{z)^'^. The 
dependence on the angular size corresponding to physical 
radius r is handled by iiNFW, which we evaluate assuming 
that the shape of the total mass distribution near rsoo is well 
approximated by the NFW profile; in this case the scaling 
factor is straightforward to compute using the scaling of rsoo 
given in Equation 12: 



-Rnfw ~ 



ln(l -I- XC500) — a::csoo/(l + a;csoo) 



(14) 



ln(l 4- C500) — csoo/(l + csoo) 

where x = rsoo /rsoo- To evaluate this factor, we assume 
a concentration parameter c = 4,^ although we note that 
7?NFW is extremely insensitive to this assumption provided 
that c > 3 (i.e. provided rsoo is well beyond the scale radius). 
Similarly, we can write the scaling of the luminosity as 



Lsooir) oc di(z) 



rsoo 
dA{z) 



(15) 



where dL{z) is the luminosity distance and the first term is 
the scaling of the luminosity measured within a fixed angle; 
the second term is simply the angular size of rsoo raised 
to the power of ijl, the logarithmic slope of the integrated 
luminosity profile. As with the gas mass profiles, we fit this 
slope between 0.7 and l.Srsoo, finding rjL = 0.1135 ± 0.0005 
from the entire sample, and marginalize over the uncertainty. 

Since the emission- weighted, average temperature 
within rsoo is a weak function of rsoo compared with a typi- 
cal statistical error bar, and considering the relative insensi- 
tivity of the cosmological analysis to the precise kT values, 

^ Note that, by convention, c without a subscript refers to C20O1 
defined as r200 in units of the NFW scale radius; c = 4 corre- 
sponds to Csoo ~ 2.6. 



we do not similarly model any dependence of the measured 
temperatures on model parameters. However, we marginal- 
ize over a global 10 per cent Gaussian systematic uncertainty 
on the temperature measurements to account for residual 
uncertainty in instrument calibration. 

Since the masses, luminosities and temperatures from 
the follow-up observations are determined from the same 
data, their statistical error bars are correlated. The masses 
and luminosities are integrated within rsoo, so their er- 
rors are primarily correlated via the uncertainty in rsoo- 
The luminosity and temperature errors are also somewhat 
correlated, since the K-corrections that convert flux to lu- 
minosity have a weak temperature dependence. (There is 
no correlation between mass and temperature errors using 
our method; see Paper II.) Since these measurement errors 
were propagated via Monte Carlo, it is straightforward to 
compute the error correlations from the samples; we find 
Pmi ~ 0-29 ± 0.19 and p^f — —0.4 ± 0.3, averaged over the 
cluster sample. Note that the latter quantity is not related to 
pttm, defined in Section 3.3, which quantifies the correlation 
between departures in luminosity and temperature from the 
nominal scaling relation, not the correlation of measurement 
errors. 



3.5 Sampling model: survey 

The final ingredient is a sampling model for the selection 
of our clusters from the RASS, incorporating the selection 
function of each cluster sample, the survey flux measurement 
errors, and the relation between the flux measurements re- 
ported in each cluster sample and Fsoo , the flux within rsoo 
from follow-up observations (in the same energy band). The 
fluxes reported for each cluster sample are measured differ- 
ently (see Ebeling et al. 1998, 2001; Bohringer et al. 2004) 
and none corresponds to a flux measurement within a fixed- 
overdensity radius such as rsoo- Fortunately, we find a good 
empirical correlation between the reported survey fluxes and 
our fluxes measured from the follow-up observations for clus- 
ters well above the flux limit. ^ We include a simple linear 
conversion between survey flux, Fs, and follow-up observa- 
tion flux, -Fsoo, in the model. 



Fs = K-Fsoo, 



(16) 



where the slope, k, is allowed to be different for each clus- 
ter sample. Because the survey fluxes suffer from Malmquist 
and Eddington biases, the k's must be constrained simulta- 
neously with the full model, including the cosmology and 
scaling relation. These parameters also trivially account for 
any residual uncertainties in the flux cross-calibration be- 
tween ROSAT and Chandra (see Paper II). 

The flux measurement error as a function of reported 
flux for each sample is consistent with Poisson scaling. 



a F oc vFs 



(17) 



although the normalization varies by sample due to the dif- 
ferent algorithms used to measure the flux. We approximate 
the survey flux sampling distribution as Gaussian, centered 
on the true flux, with width determined via Equation 17, 



^ For true fluxes near or below the flux limit, Malmquist bias 
distorts the relation. 
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fitting a separate normalization for each cluster sample di- 
rectly from the reported fluxes and errors. 

Finally, the selection function of each cluster sample, 
the probability that a cluster is included in the sample as 
a function of redshift and measured survey flux, is modeled 
by interpolating the look-up tables provided for each sam- 
ple (Ebeling et al. 1998, 2010; Bohringer et al. 2004). Con- 
servatively, we include an additional nuisance parameter to 
marginalize over possible uniform incompleteness or impu- 
rity at the 5 per cent level, 

Psol(2,Fs) ocC, (18) 
where each sample can have a different value of ^. 



4 ANALYSIS METHOD 
4.1 Likelihood 

Early cosmological work using galaxy cluster samples used 
an approach based around binning the detected clusters 
in redshift and flux (or luminosity) and either adopt- 
ing external priors on the luminosity-mass relation (e.g. 
Borgani et al. 2001) or simultaneously fitting an external 
luminosity-mass data set without accounting for selection 
effects (e.g. Allen et al. 2003).* In M08, we took this ap- 
proach to its logical limit by deriving the likelihood for bins 
of infinitesimal volume; however, that work still suffered 
from the fact that it used an external data set to constrain 
the luminosity-mass relation without explicitly accounting 
for selection bias. Consequently, it was necessary to restrict 
that external data set (Reiprich & Bohringer 2002) to low 
redshifts and high fluxes in order to minimize the effects of 
selection bias, making it impossible to test for departures 
from self-similar evolution in the scaling relation. More re- 
cently, Vikhlinin et al. (2009a, b) binned their detected clus- 
ters in redshift and mass (again with infinitesimally small 
bins) and used the same cluster sample to constrain the scal- 
ing relations; however, their procedure still does not produce 
a self-consistent fit for both scaling relations and cosmology. 

In this section, we show that our procedure from M08 
can be generalized to allow such a simultaneous and self- 
consistent fit, using follow-up observations of fiux-selected 
clusters to constrain the scaling relations over the full red- 
shift range of the data, and accounting fully for the pres- 
ence of Malmquist and Eddington biases. We also show that 
the corresponding likelihood function can be derived from 
first principles, beginning with a simple Bayesian regression 
model. For simplicity, we derive the likelihood for the general 
problem of counting sources as a function of their properties. 
This general picture includes the following components: 

(i) A population function,^ (dN/dx), which provides a 
theoretical prediction for the underlying distribution of 



° We note that some authors have followed the (in principle) 
equivalent approach of using a scaling relation to transform 
the predicted mass function into, e.g. the baryonic mass func- 
tion (Voevodkin &; Vikhlinin 2004) or X-ray temperature function 
(Henry 2004; Henry et al. 2009). Although the statistical formal- 
ism discussed in this section is considered as an extension of the 
flux-redshift binning approach, it also underlies these methods. 
^ Note that this nomenclature is not widely used. We adopt the 



sources (i.e. their number, N) as a function of properties, 
X (see below). 

(ii) Population variables, x, on which the population 
function depends. 

(iii) Response variables, y, which obey a stochastic scaling 
relation as a function of x. 

(iv) The stochastic scaling relation, P{y\x). 

(v) Observed values, x and y. Note that all of the x and 
y need not actually be measured, the exception being those 
which determine whether a source is included in the sample 
(detected; see also Section 4.1.3). 

(vi) Sampling distributions for the observations as a func- 
tion of population and response variables, P{x,y\x,y). 

(vii) A selection function, P{I\x, y, x,y). I represents the 
inclusion of a source in the sample. In full generality, the 
selection function could be a function of true variables as 
well as observations; for example, if it is calibrated using 
reliable simulations. 

Throughout the following, we make the approximation that 
the clustering of sources is unimportant compared with the 
pure Poisson nature of their occurrence, which is justified for 
current all-sky surveys of very massive clusters (i.e. where 
the survey dimensions are large compared with the correla- 
tion length), as in this study (Hu & Kravtsov 2003; Holder 
2006). Provided this assumption remains valid, the approach 
described here can be applied equally well to cluster stud- 
ies based on optical or Sunyaev-Zel'dovich (SZ) surveys. We 
note that this is only a simplifying assumption; in princi- 
ple, the approach detailed below can be straightforwardly 
generalized to include the spatial correlation of sources. 



4-1.1 Binning derivation 

Divide the space {x, y) into bins (indexed by j) of very small 
volume AijAj/j, such that no bin contains more than one 
detected source, and both the population function and scal- 
ing relation are approximately constant over the bin volume. 
The expected number of detected sources in bin j, located 
at {xj,yj), is 

(iVdct,,) = (Ax.Ay,) J dx J dyl^^'jPiylx) 

X P{£j,yj\x,y)P{I\x,y,Xj,yj). (19) 

The likelihood, C, is a product of independent Poisson 
likelihoods for each bin. 



n 



n (^dct..>, 



(20) 



where Nj is the actual number of clusters detected in bin 
j, (A^dct) is the predicted total number of detected sources 
(summed over all bins), and the second equality follows from 
the fact that Nj G {0, 1} by construction. 

Since the binning scheme must be invariant under 



term "population function" to imply an analogy to the mass func- 
tion used in this study. 
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changes to the values of the model parameters, the deriva- 
tive of the log-likelihood is independent of the bin volumes, 
AxjAijj. Defining (ridctj) = {Ndat,j) /{AxjAyj), we can 
write 



(21) 



and ignore the constant of proportionality for the purposes 
of maximizing C. 



4.1.2 



qression derivation 



In the regression of truncated data (where some sources 
are not detected and the total number of sources is thus 
unknown), the total number of detected plus undetected 
sources, A*', becomes a parameter of the model and must be 
marginalized over (for background, see Gelman et al. 2004; 
Kelly 2007). We define (iV), (iVdot) and (iVmis) to be respec- 
tively the predictions for the total number of sources, num- 
ber of detected sources, and number of undetected (missed) 
sources as follows: 



\x) 



(iVdot) 



(iVmis) = (N) - (TVdct) . 

The joint likelihood of the observations and N is 



(22) 



dx / dy P{x,y\x,y)P{I\x,y,x,y), 



£{x,y,N) = 



-{N) 



Nl 



iVdet 
N, 



!iVn 



(23) 



i=l j=l 

The first term is a Poisson likelihood for the model param- 
eter A'', the total number of sources. The second factor is 
a binomial coefficient enumerating the number of ways of 
selecting A^dct detected sources from A*'. This term is nec- 
essary because, in the Bayesian treatment, the true source 
properties (a:, y) axe random variables; as such, sources are 
distinguishable only in terms of their detection (/) or non- 
detection (/). The probability that the A'dot detected sources 
have measurements (x, y) is accounted for by the product 
over detected objects. The second product accounts for the 
likelihood of not detecting an additional A'^i^is sources whose 
properties are distributed according to the population func- 
tion and scaling relation. 

Noting that the marginal probability for a source to 
have properties x is 

{dN/dx) 



P{x)^ 



(TV) 



(24) 



the functions Pdct and Pmis can be related to quantities de- 
fined previously (cf. Equations 19 and 22): 



Pdet{Xi,yi, I) = 



dx 



X Pixi,yi\x,y)P{I\x,y,Xi,yi) 



(25) 



and 

Pmis (/) 



dx I dy^!^Piylx) 



X J dx I dy P{x,y\x,y)P{I\x,y,x,y) 
(N) 



(26) 



Substituting these expressions, the likelihood simplifies to 



£.ix,y,N) 



(NY 



A^dot! 



A^mis! 



(27) 



The first factor above is unity, and the second is a normaliza- 
tion term independent of model predictions. The third term 
is a Poisson density which sums to unity when jC is marginal- 
ized over A'^ from A'det to infinity (equivalently, A'mis from 
zero to infinity). The remaining terms are exactly equivalent 
to those in Equation 21.^° 



4.1.3 Application to the growth of structure 

The formalism of the last sections can be straightforwardly 
applied to observations of the growth of cosmic structure. 
The population variables describing clusters are redshift, 
z, and mass, m, and the population function is simply the 
product of the mass function and the comoving volume ele- 
ment, dN/dzdm — {dn/drn){dV/dz) (see definitions in Sec- 
tion 3). The response variables are luminosity, £, and tem- 
perature, t, which scale with mass and redshift as described 
in Section 3.3. The observations include spectroscopically 
determined redshifts (for which we neglect the measurement 
errors in practice), the masses, luminosities and tempera- 
tures determined from follow-up observations, and the sur- 
vey fiuxes. Their sampling distributions, which account for 
the correlations among m, £ and t measurements from the 
same follow-up data, are discussed in Sections 3.4 and 3.5. 
The selection functions of the cluster samples are given as 
functions of redshift and observed survey flux, and incorpo- 
rate the appropriate sky coverage fraction. 

It is important to note that this analysis method does 
not require all the potentially observed values {x, y} to be 
available for each source, apart from those which determine 
whether a source is included in the sample. This is perhaps 
most apparent from considering how Equation 25 would be 
modified if a particular observable, yo, were unavailable: 
an additional integration over yo would be required, which 



^'^ The normalization factors present in Equations 20 and 27 are 
different because the two likelihood functions are formally defined 
over different domains. What is significant is that both functions 
have the same dependence on model predictions. 

However, the observations do have to span the range of val- 
ues present in the sample reasonably well, in order to adequately 
probe the scaling relations. 
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would result in P(yo\x,y) being replaced by unity.^'^ In the 
binning derivation, this is analogous to the bin containing 
that cluster being unbounded rather than constrained in the 
yo direction. This result is intuitive, as it simply means that 
the integral over yo for that cluster is not weighted by the 
information gained from a measurement. 

Thus, our analysis makes use of the available informa- 
tion for all 238 clusters meeting our selection criteria. For the 
144 clusters without follow-up data, only the redshift and 
survey fluxes are available; following the argument above, 
the terms associated with follow-up luminosity, mass and 
temperature measurements for each of these clusters inte- 
grate to unity. In effect, the likelihood for these clusters (here 
we refer to the terms (ndot,j)) simplify to the form used in 
M08, which depend only on observations of redshift and sur- 
vey flux. For the 94 clusters with follow-up measurements, 
the analysis uses the redshift and survey flux, in addition 
to the (independently measured) follow-up luminosity, mass 
and temperature. The latter are, of course, crucial for ob- 
taining simultaneous constraints on the scaling relations. 

The ability to use partial follow-up in a statistically rig- 
orous way has significance for the planning of large, future 
cluster surveys. In particular, it reveals as overly simplistic 
the choice between strategies employing exhaustive follow- 
up observations to calibrate scaling relations over a rela- 
tively small mass range and those using "self-calibration", 
which use many more sources but rely exclusively on the 
shape of the mass function to provide information about 
the scaling relations. Indeed, obtaining follow-up data for a 
fair sub-sample of the detected clusters potentially allows 
robust calibration of the scaling relations over a wide mass 
range, without unnecessarily limiting the size of the sample 
used for cosmology. However, we note that this attractively 
simple strategy is not necessarily the best; a more complete 
analysis, including an optimization procedure similar to that 
employed by Wu, Rozo, & Wechsler (2010), is warranted in 
planning the follow-up of future surveys. 

Finally, we notice that the procedure described above 
is a conceptually simple generalization of that of M08. In 
fact. Equation 21 is identical to Equation 18 in M08, with 
the exception that our bins are now defined in the higher- 
dimensional space including all observable quantities, rather 
than only survey flux and redshift. 



4.2 Mechanics and priors 

Using the likelihood function detailed in Section 4.1, cosmo- 
logical constraints were obtained via Markov Chain Monte 
Carlo (MCMC), employing the Metropolis sampler embed- 
ded in the COSMOMC code^* of Lewis & Bridle (2002). The 
May 2008 COSMOMC release includes the 5-year WMAP and 
Union supernova data and analysis codes; an additional 



module implementing the /gas analysis has also been pub- 
licly released (Rapetti et al. 2005, A08).^'^ Further modifi- 
cations were made to include the likelihood codes for the 
XLF and BAO data. The CMB and matter power spec- 
trum calculations were performed using the CAMB pack- 
age of Lewis, Challinor, & Lasenby (2000),^^ suitably mod- 
ified to incorporate the evolving w models of Section 3.1 
(Rapetti et al. 2005). 

The set of parameters describing the complete model is 
summarized in Table 1, along with the priors used in our 
analysis. 



4.3 Improvements over M08 

Compared with our previous work in M08, the constraints 
on fiat ACDM and constant w models reported below are 
tighter by a factor of 2-3. Given that we have used essen- 
tially the same cluster samples in the two studies, it is sen- 
sible to address the specific differences in the two analyses 
that result in these improvements. We attribute the change 
to several closely related improvements in our analysis. 

(i) Cluster mass determination: In M08, we calibrated 
the X-ray luminosity-mass relation using the HIFLUGCS 
data of Reiprich & Bohringer (2002), for which masses were 
estimated from X-ray data, assuming hydrostatic equilib- 
rium. This procedure is known to underestimate the mass 
by tens of per cent (e.g. Nagai et al. 2007; Mahdavi et al. 
2008), since the intracluster medium is not supported solely 
by thermal pressure. However, the overall size of this effect 
is not well established, nor is the variation in non-thermal 
pressure from cluster to cluster. Consequently, that earlier 
work was forced to incorporate a large systematic uncer- 
tainty in the mass measurements, which significantly de- 
graded constraints on the scaling relation and its intrinsic 
scatter, resulting in a corresponding degradation in cosmo- 
logical constraints. 

The present work avoids this issue by using gas mass as 
a proxy for total mass (see details in Paper II). Unlike total 
mass, gas mass can be measured from X-ray data with very 
little bias. The relation between gas mass and total mass 
is provided by A08; ultimately, the cluster mass scale is set 
using the hydrostatic method for the clusters in that work, 
i.e. the class of hot, dynamically relaxed clusters for which 
the bias due to non-thermal pressure is minimal. 

(ii) Follow-up data over a range of redshifts: The statisti- 
cal methods employed in M08 lack the internal consistency 
of our new procedure (Section 4.1), making it impossible 
to rigorously incorporate follow-up data at redshifts much 
greater than zero.^^ As a result, we could not directly con- 
strain the evolution of the scaling relations, and instead 
marginalized over a range of possibilities, using conservative 
priors. With our new method, follow-up data spanning the 



This simplification occurs because the selection function is nec- 
essarily not dependent on yg. Similarly, in practice. Equation 26 
need be integrated only over observables that influence selection. 

Sahlen et al. (2009) demonstrate the improvement in cosmo- 
logical constraints available to future surveys by incorporating 
follow-up observations. 

http: / /cosmologist . inf o/cosmomc/ 



http : //www. Stanford. edu/~ drapetti/f gas_module/ 
http://www.cajnb.info 

In detail, the follow-up data were restricted to low redshifts 
{z < 0.11) where the HIFLUGCS discovery space was large, i.e. 
where many clusters were found well above the flux limit. This 
procedure reduces the biasing effects of flux selection, although it 
does not eliminate them. 
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Figure 1. Left: Joint 68.3 and 95.4 per cent confidence regions for parameters of the ACDM cosmology from the BCS (blue), REFLEX 
(green) and Bright MACS (red) cluster samples individually, including all systematic allowances in Table 1. Note that only the 95.4 per 
cent confidence regions are visible for BCS and REFLEX. Right: Constraints from the full XLF data set (purple) and 5-year WMAP 
data (blue; Dunkley et al. 2009, marginalized over the SZ signal). Results from the combination of the XLF and WMAP5 data are shown 
in gray. 



full redshift range of cluster detections can be included, al- 
lowing evolution in the scaling relations to be tested directly 
(Paper II) . Relatedly, the distribution of follow-up data over 
a range in redshift improves the constraints on the dark en- 
ergy equation of state. 

(iii) Use of /gas data: As discussed by Vikhhnin et al. 
(2009b), the method used to estimate masses has an ef- 
fect on the constraints obtained because different mass prox- 
ies have different dependencies on distance, with gas mass 
(Mgas oc d^'^) being more sensitive than temperature or Yx- 
Our results are therefore sensitive to this choice at some 
level. 

Another way to look at this is to notice that the AOS data 
that we use to constrain /gas inherently contain informa- 
tion about Qrn in addition (given a bound on the baryonic 
depletion of clusters). Thus, our results on Q m are primar- 
ily, though not entirely, driven by these /gas data. With the 
value of /gas constrained, the XLF data then determine erg 
and w. Note that, as mentioned in Section 2, we use only six 
clusters from AOS at redshifts z < 0.15; due to this redshift 
restriction, these /gas data do not produce a constraint on 
w by themselves. 

Although it is not hsted explicitly above, we emphasize 
that the new statistical method outlined in this section is 
ultimately the most important advantage over M08, since 
the improvements in follow-up data and mass determination 
cannot be fully or fairly exploited without it. 



15 




0.1 0.2 0.3 0.4 0.5 

Figure 2. Marginalized posterior distributions for f2m in the 
ACDM model from analysis of the XLF data (purple, short 
dashed line; including systematic allowances in Table 1), 5-year 
WMAP data (blue, solid line; Dunkley et al. 2009, marginalized 
over the SZ signal), cluster /gas (red, long dashed line; AOS, in- 
cluding conservative systematic allowances), SNIa (green, 3-dot- 
dashed line; Kowalski et al. 2008, including their treatment of 
systematics) , and BAO (brown, dot-dashed line; Percival et al. 
2007, also using our standard priors on h and Hb/i'^). Note that 
the XLF and /gas results are not independent (Sections 2 and 
4.3). 



5 RESULTS 

5.1 Constraints on the ACDM model 

For a spatially flat, cosmological constant {w = —1) model, 
the joint constraints on fim and erg obtained from the BCS, 
REFLEX, and Bright MACS cluster samples individually 
are displayed in the left panel of Figure 1. Results from 



combining the 3 samples appear as purple contours in the 
right panel; the constraints, marginalized over the system- 
atic allowances listed in Table 1, are ilm = 0.23 ± 0.04 
and (78 = 0.82 ± 0.05 (Table 2). These results agree well 
with the tight constraints obtained from WMAP5 data 
(Dunkley et al. 2009) for this model (blue contours in the 
right panel). Results from combination of WMAP5 and 
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Figure 3. Joint 68.3 and 95.4 per cent confidence regions for parameters of tlie constant w model. Left: constraints on Qm and w 
from the XLF (purple, including all systematic allowances in Table 1) are compared -with those from cluster /gas data (red; AOS, 
including conservative systematic allowances), 5-year WMAP data (blue; Dunkley et al. 2009, marginalized over the SZ signal), SNIa 
data (green; Kowalski et al. 2008, including their treatment of systematics) , and BAO observations (brown; Percival et al. 2007, also 
using our standard priors on h and Q]jh'^). Results from combining these 5 data sets are shown in gold. Right: Constraints on erg and w 
from XLF and WMAP5 data. The combination of the XLF and WMAP5 yields the gray contours; adding the other data listed above 
produces the gold contours. 



the XLF (gray contours) are somewhat improved: ilm = 
0.26 ± 0.02 and erg = 0.80 ± 0.02. Our XLF results are in 
agreement with recent estimates based on other X-ray se- 
lected (Henry et al. 2009; Vikhhnin et al. 2009b) and opti- 
cally selected (Rozo et al. 2010) cluster samples, and a va- 
riety of independent cosmological data (Percival et al. 2007, 
2010; AOS; Fu et al. 2008; Ho et al. 2008; Kowalski et al. 
2008; Hicken et al. 2009; Reid et al. 2010, see also references 
in M08). 

As mentioned in Section 3.1, the scalar spectral index. 
Us, is fixed at 0.95 in our analysis of the XLF data alone. 
The only parameter that is significantly degenerate with Us 
is as', our results for values of ris other than 0.95 can be 
adequately described by shifting the erg constraints along 
the linear relation ag = 0.82 + 0.25(ns - 0.95). 

The marginalized posterior distribution for Qyn ob- 
tained from the XLF data is compared with those ob- 
tained from the analysis of WMAP5, /gas (A08), SNIa 
(Kowalski et al. 2008), and BAO (Percival et al. 2007, also 
using our standard priors on h and Qhh^) data in Figure 2. 
The agreement among all the data sets is good, though note 
that the XLF and /gas results are not independent (Sec- 
tions 2 and 4.3). 

In Paper II, we show that this simple cosmological 
model (and the simple scaling relation model of Section 3.3) 
provides an acceptable fit to the XLF data. Constraints on 
the parameters describing the scaling relations are also pre- 
sented in that work. 



5.2 Constraints on constant w models 

For models in which the dark energy equation of state, w, 
is constant in time, the joint constraints on flm and w from 
the XLF are shown in the left panel of Figure 3 (purple 
contours). The constraint on the equation of state from the 
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Figure 4. Joint 68.3 and 95.4 per cent confidence regions for 
parameters of the constant w model, including conservative sys- 
tematic allowances. Purple contours indicate constraints from the 
XLF (which includes six z < 0.15 /gas clusters; see Sections 2 and 
4.3), while constraints from all 42 /gas clusters alone are shown 
in red. Results combining the XLF data with all 42 /gas clusters 
appear in green. The combination of these two types of cluster 
data with standard priors on h and n^/i^ yields a competitive 
constraint on dark energy, w = —1.06 ± 0.15. 

XLF alone, which is driven by the evolution in the cluster 
mass function, is ui = —1.0 ± 0.2 (including systematics), a 
factor of 3 improvement over M08. Our results on f2m and 
erg are nearly identical to the ACDM case (Table 2); these 
constraints are driven by the cluster data at low redshift 
and are thus largely insensitive to the properties of dark en- 
ergy. Figure 3 also demonstrates the agreement of our results 
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Figure 5. Joint 68.3 and 95.4 per cent confidence regions, including conservative systematic allowances, for parameters of the evolving w 
models. Left: constraints on the {wq , Wa) model (Equation 1) from the combination of XLF and 5-year WMAP data are shown in gray, as 
well as the combination of XLF, WMAP5 (Dunkley et al. 2009), cluster /gas (A08), SNIa (Kowalski et al. 2008) and BAO (Percival et al. 
2007) data (gold). Right: Constraints on the {wQ,Wet) model (Equation 2). The transition scale factor is marginalized over the range 
0.5 < at < 0.95. Crosses in each panel indicate the ACDM model, with constant w = —1. 



with independent constraints from other cosmological data 
(Section 2), including WMAP5 (blue), SNIa (green), clus- 
ter /gas (red) and BAO (brown). The combination of these 
data sets appears as gold contours in the figure; the im- 
proved, marginalized constraints from this combination are 
= 0.27 ± 0.02, 0-8 = 0.79 ± 0.03 and w = -0.96 ± 0.06. 
Our results are also consistent with those derived indepen- 
dently by Vikhlinin et al. (2009b) from X-ray flux-selected 
clusters. 

The constraints in the as-w plane from the XLF and 
WMAP5 data are shown in the right panel of Figure 3. 
The complementarity of the two data sets is evident, with 
the XLF constraint on erg breaking an important degen- 
eracy affecting the CMB data. The combination of only 
the XLF and WMAP5 data yields the grey contours in 
the figure, corresponding to the one-dimensional constraints 

= 0.27 ± 0.04, erg = 0.78 ± 0.04 and w = -0.95 ± 0.14. 

We note that the combination of the two types of clus- 
ter data used here, XLF and /gas (with standard priors on h 
and fib/i^), also places competitive constraints on dark en- 
ergy: w — — 1.06±0.15 (Figure 4). Interestingly, this result is 
comparable to that of the combination of XLF with WMAP5 
data, or /gas with 3-year WMAP, Cosmic Background Im- 
ager, Arcminute Cosmology Bolometer Array Receiver, and 
BOOMERanG data {w = -1.00 ± 0.14; AOS). 

As discussed in Section 3.1, our evaluation of the lin- 
ear matter power spectrum for models with w 7^ — 1 includes 
the effects of dark energy density perturbations. For compar- 
ison with other works where dark energy perturbations are 
neglected, we also performed an analysis where dark energy 
affects only the expansion of space (i.e. where it has uniform 
density) despite not being a cosmological constant. In this 
case, we obtain from the XLF the slightly larger constraint 
w = —0.97 ± 0.25, including all systematic uncertainties in 
the usual way. 



5.3 Constraints on evolving w models 

Constraints on the two evolving w models discussed in Sec- 
tion 3.1 (Equations 1 and 2), (wojWa) and (lUojTOct), are 
shown respectively in the left and right panels of Figure 5. 
Results from the combination of XLF and WMAP5 data are 
shown as gray contours, and the combination of those two 
with /gas, SNIa and BAO data is shown in gold. In both 
cases, there is no evidence for evolution in the dark energy 
equation of state, and the results are consistent with the 
cosmological constant model (indicated by a cross in the fig- 
ure). Marginalized constraints are listed in Table 2. For the 
more general (wq, Wct) model, we find wo = — 0.88±0.21 and 
Wet ~ -1-05^0 36 from the combination of all the data (Fig- 
ure 6). These results are a significant improvement over pre- 
vious constraints on this model from the combination of /gas, 
CMB and SNIa data {wo = -1.05^o.26, Wct = -0.83to;43; 
AOS). 

Adding spatial curvature as a free parameter and fixing 
the transition redshift to 1 (Equation 1), we obtain con- 
straints equivalent to [a{wp)a{wa)]^^ = 15.5, using the 
notation and definitions of the Dark Energy Task Force 
(DETF; Albrecht et al. 2006). 



6 INVESTIGATION OF SYSTEMATICS 
6.1 Sensitivity to priors 

Our analysis of the XLF data alone includes priors to con- 
strain the Hubble parameter and mean baryon density, and 
to marginalize over systematic allowances on the mass func- 
tion, cluster sample completeness and cluster gas mass frac- 
tion (Table 1). To investigate the influence of each prior 
individually, we importance sampled the Markov chains pro- 
duced in the XLF analysis for the constant w model, reduc- 
ing the width of each prior in turn by a factor of two. If 
importance sampling a particular prior in this way changes 
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Figure 6. Joint 68.3 and 95.4 per cent confidence regions from 
the combination of XLF, /gas, WMAP5, SNIa and BAO data, 
including systematic uncertainties, for parameters of the (uiq, Wct) 
model (Equation 2) . The results are the same as in the right panel 
of Figure 5, but with both wq (green contours) and Wet (blue) 
shown on the j;-axis, against the dark energy density, Qde, on 
the X-axis. The transition scale factor is marginalized over the 
range 0.5 < at < 0.95. The horizontal, dotted line indicates the 
cosmological constant model (uiq = uiot = — 1). 
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Figure 7. Joint 68.3 and 95.4 per cent confidence regions for Qrn 
and erg in the constant w model from the XLF data, including the 
standard systematic allowances in Table 1, are shown in purple. 
The blue contours result from reducing the width of all of the al- 
lowances associated with the /gas model of A08 by a factor of two. 
The figure demonstrates that our results on Qm and trg are lim- 
ited by the systematic uncertainty in /gas . (These allowances have 
no effect on the w constraint.) Improvements in hydrodynamical 
simulations and the incorporation of gravitational lensing mass 
measurements offer the possibility of significantly reducing the 
uncertainty in /gas, and thus improving the constraints. 



the posterior distribution for parameters of interest, we can 
conclude that the prior is influential. 

Of the priors listed above, only the systematic al- 
lowances associated with the determination of /gas are sig- 
nificant. Since the gas mass fraction determines the overall 
mass scale when Mgas is used as a proxy for total mass, 
its systematic uncertainty affects primarily the constraints 
on nm and as, as shown in Figure 7. Following AOS, our 
standard analysis uses conservative systematic allowances 
for Chandra calibration (10 per cent), non-thermal pressure 
support (10 per cent), the depletion of baryons in clusters 
with respect to the cosmic mean (20 per cent), and evolu- 
tion with redshift in the baryon fraction and stellar content 
of clusters (10 and 20 per cent). The most significant for this 
work are on the depletion of baryons in clusters, which de- 
termines the width of the Qm constraint, and the amount of 
non-thermal pressure and overall calibration of Chandra for 
temperature measurements, which determine the constraint 
on as at fixed Qm- 

In contrast, none of the priors affects the constraint on 
w significantly. This was confirmed by an additional analysis 
in which all of the systematic uncertainties were eliminated, 
i.e. every nuisance parameter was fixed rather than being 
marginalized over. Ordinarily, this would indicate that our 
constraints on dark energy are statistically limited, and this 
may plausibly be the case; however, we note that our con- 
straints may also be limited by the fact that the effect of 
dark energy perturbations on the mass function is not yet 
understood. If these perturbations produce changes in the 
high-mass tail of the mass function at the few tens of per 
cent level, there may be additional constraining power avail- 
able from the current XLF data. 



6.2 Choice of mass function 

In our standard analysis, we have used the mass function of 
Tinker et al. (2008) with cluster radius defined by an over- 
density of 300f2m(2). One could also justify using the mass 
function determined at a cluster radius closer to the radius 
where the follow-up mass measurements are made, rsoo- We 
therefore repeated the XLF analysis using the mass func- 
tion given by Tinker et al. for overdensity 1600r2m(2). For 
fim values of 0.2-0.3 and z < 0.5, this corresponds to over- 
densities of 300-1000, compared with 50-200 for 300nm(z). 
Our results with this alternative mass function were virtu- 
ally identical to the standard results. 

Similarly, we repeated the analysis using the older mass 
function of Jenkins et al. (2001) at overdensity 324ilrn(z). 
Here we used a 20 per cent Gaussian systematic allowance 
on only the normalization of the mass function and fixed 
the shape, as in M08. Again, the cosmological results were 
essentially identical to our standard results, despite the fact 
that the Jenkins function does not include evolution with 
redshift, while the Tinker function does. This indifference 
to the details of the mass function is consistent with the 
observation in Section 6.1 that the current results are insen- 
sitive to the systematic allowances on the mass function and 
its evolution. 

6.3 Note on Chandra calibration 

A major, recent (21 January 2009) update to the Chandra 
ACIS effective area at soft energies is accounted for in our 
analysis of follow-up Chandra observations for the XLF data 
(see Paper II). However, the raw data analysis in AOS pre- 
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Table 2. Marginalized 68.3 per eent eonfidence intervals on cosmological parameters from our analysis, including all systematic uncer- 
tainties in Table 1. The various cosmological models are discussed in Section 3.1. "All" refers to the combination of XLF, cluster /gas, 
WMAP5, SNIa and BAO data. 



Data 


Model 




C8 




t^a , tf et 


XLF 
XLF+WMAP5 
all 


ACDM 
ACDM 


0.26 ± 0.02 
0.257 ±0.015 


U.O^ U.UiJ 

0.80 ±0.02 
0.80 ±0.02 


— 
— 







XLF 
XLF+WMAP5 

XLF + /gaa 

all 


constant w 
constant to 
constant w 
constant w 


0.23 ± 0.04 
0.27 ±0.04 
0.22 ± 0.04 
0.272 ± 0.016 


0.82 ± 0.05 
0.78 ± 0.04 
0.83 ± 0.05 
0.79 ± 0.03 


-1.01 ± 0.20 
-0.95 ± 0.14 
-1.06 ± 0.15 
-0.96 ± 0.06 




XLF+WMAP5 


evolving w (wa) 


0.26 ± 0.04 


0.79 ± 0.05 


-0.77 ±0.31 


-o.34t?;S 


all 


evolving to (wa) 


0.256 ± 0.016 


0.80 ±0.03 


-0.93 ± 0.16 


-0 


XLF+WMAP5 


evolving w {wct) 


0.26 ± 0.04 


0.79 ± 0.05 


-0.73 ± 0.40 


-1 10+'''^^ 


all 


evolving w (tOct ) 


0.257 ±0.016 


0.80 ±0.03 


-0.88 ± 0.21 


J^-UO_Q 3g 



dates this calibration update, which in our tests typically 
results in an increase in the inferred gas fraction of ~ 10 per 
cent. We have accounted for this correction by shifting the 
center of the Chandra calibration" nuisance parameter in 
the AOS model such that the preferred value of /gas increases 
correspondingly by 10 per cent. The Gaussian prior on this 
parameter also has a width of 10 per cent, so this systematic 
allowance encompasses both the old value and the new value 
expected from the calibration update. 

The effect of this higher gas mass fraction is to shift the 
Jim constraint from the /gas analysis to lower values by ap- 
proximately 0.02. A corresponding shift appears in the XLF 
results, since the fim constraints are partially driven by our 
use of a subset of the /gas data to set the overall cluster 
mass scale. The correction additionally results in a shift of 
similar magnitude towards lower as values, but has no effect 
on the determination of w. While we expect that the 10 per 
cent Gaussian allowance adequately reflects the systematic 
uncertainty in the overall Chandra calibration, these trends 
should be kept in mind in interpreting our results. For exam- 
ple, if we have overestimated the effect that the calibration 
update has on the value of /gas , our best fit results will shift 
slightly to higher f2,n and ug. 



7 CONCLUSION 

We have presented cosmological constraints obtained from 
an X-ray flux-limited sample of 238 massive galaxy clusters 
spanning the redshift range z < 0.5. Follow-up Chandra or 
ROSAT X-ray observations of 94 of these clusters are in- 
corporated, as detailed in Paper II. Our analysis accounts 
for all selection biases, includes conservative allowances for 
systematic uncertainties, and, for the first time, produces si- 
multaneous constraints on cosmology and the cluster scaling 
relations using a rigorous and fully self-consistent statistical 
method. The incorporation of follow-up data and our im- 
proved analysis method result in cosmological constraints 
that are a factor of 2-3 better than our previous results in 
M08, which were based on the same flux-limited sample of 
clusters. The results presented here are among the tight- 



est and most robust constraints on cosmological parameters 
available from current data. 

The constraints on spatially flat, cosmological constant 
models from our XLF data are fim = 0.23 ± 0.04 and 
(T8 = 0.82 ± 0.05. Introducing a constant dark energy equa- 
tion of state, w, as a free parameter, we find w = — 1.0±0.2, 
obtaining the same tight constraints on f2m and ag. These 
results confirm at higher precision the first constraints on the 
dark energy equation of state from the analysis of XLF data 
reported by M08, and are consistent with independent find- 
ings based on cluster gas mass fractions, CMB anisotropics, 
type la supernovae, baryon acoustic oscillations, galaxy red- 
shift surveys, and cosmic shear. We also find good agreement 
with recent, independent analyses of X-ray selected galaxy 
clusters (Henry et al. 2009; Vikhlinin et al. 2009b) and op- 
tically selected clusters (Rozo et al. 2010). 

Using the combination of XLF, /gas, WMAP5, SNIa 
and BAO data, we investigate evolving w models, including 
the standard {wo,Wa) model as well as a (iuo,'''ct) model in 
which the transition scale factor, at, is marginalized over. 
In both cases, the best current cosmological data remain 
consistent with a constant value of ui = —1. The con- 
straints on the non-flat {wo,Wa) model from the combina- 
tion of current data are equivalent to DETF flgure of merit 
[a{wp)a{wa)]~^ ~ 15.5. Compared with the flgure of merit 
of 8.3 reported by Wang (2008) from the combination of 
WMAP5, SNIa and BAO data, this result highlights the 
contribution of cluster data (XLF and /gas), and their com- 
plementarity to other cosmological information. 

Currently, the most signiflcant systematic limitation on 
our results for Qm and ag is systematic uncertainty in the 
cluster gas mass fraction, /gas. Speciflcally, the dominant 
uncertainty in the determination of f2m is in the depletion 
of baryons in clusters relative to the cosmic mean, for which 
a 20 per cent systematic allowance is adopted in the present 
work. The prospect of signiflcant improvement here exists, 
as numerical simulations incorporate more complete treat- 
ments of baryonic physics, especially feedback and star for- 
mation processes in the centers of clusters. The results on 
erg are limited by our knowledge of the non-thermal pressure 
in clusters and the overall calibration of X-ray temperature 
measurements; cluster mass measurements via gravitational 
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lensing provide an observational means of circumventing this 
issue, by directly constraining the overall mass scale of the 
data. 

XLF results on dark energy remain statistically lim- 
ited. The incorporation of additional X-ray, SZ (e.g. Planck, 
the South Pole Telescope and the Atacama Cosmology Tele- 
scope) and optical/near-infrared (e.g. the Dark Energy Sur- 
vey and Pan-STARS) cluster surveys could lead to signifi- 
cant near-term improvements, although we stress that these 
surveys must have well-understood selection functions, sam- 
pling models, and scaling relations. The possibility also re- 
mains that cosmological simulations of the mass function 
including the effects of dark energy perturbations will re- 
veal additional constraining power. It is therefore important 
to pursue a theoretical understanding of the effects of fluid 
dark energy on nonlinear structure formation, even as much 
larger X-ray (eROSITA^*), and optical (e.g. the Large Syn- 
optic Survey Telescope) cluster surveys promise to increase 
the data available for cluster cosmology by orders of magni- 
tude. 

We note that the statistical approach described in this 
work has significance for the planning of follow-up observa- 
tions for future cluster surveys. Our method permits the use 
of partial follow-up, allowing scaling relations to be directly 
and robustly constrained over a wide range in mass without 
raising the cost in exposure time prohibitively, as would be 
the case for a complete, exhaustive follow-up campaign. At 
the same time, the incorporation of follow-up data provides a 
significant advantage over pure self-calibration approaches. 
In detail, any plan for X-ray follow-up observations of fu- 
ture surveys should also account for the additional benefit of 
observing dynamically relaxed clusters suitable for the /gas 
test; as we have shown, the combination of cluster /gas data, 
which directly probe the expansion of the Universe, and the 
XLF, which measures the growth of structure, can provide 
precise cosmological constraints independent of CMB and 
SNIa observations. 

Readers interested in the simultaneous constraints on 
cluster scaling relations produced in this work should refer 
to Paper II. MCMC samples encoding the results of these 
two papers will be made available for download on the web.^^ 
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